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AN  APPROXIMATION  FOR  THE  DISTRIBUTION  OF  THE  WILCOXON 
ONE-SAMTLE  STATISTIC 

D  D  O 

By  Winston  K.  Chow*  and  J.  L.  Hodges,  Jr.  rpy2rr'r*7,r'  HIE 
University  of  California,  Berkeley  ^OV  ^ 

JljibdyEUU'El 

1.  Introduction  ® 

In  19*15  Frank  Wilcoxon  suggested  a  statistic  that  may  be  used 

to  test  the  location  of  a  continuous  symmetric  population.  Without 

loss  of  generality,  we  suppose  that  the  null  hypothesis  puts  the 

center  of  symmetry  at  zero,  and  that  X^,X2,,#,,Xn  are  observations 

drawn  from  the  population.  Rank  these  observations  in  order  of 

increasing  absolute  value,  and  attach  to  each  rank  the  sign  of  the 

corratponding  X^.  There  are  2n  possible  patterns  of  signs,  and 

under  the  null  hypothesis  each  pattern  has  the  same  probability  — . 

2n 

Let  VI  denote  the  sum  of  the  ranks  with  negative  sign.  A  small 
value  of  W  constitutes  evidence  that  the  population  center  lies 
to  the  right  of  zero. 

The  Wilcoxon  test,  based  on  W,  is  attractive  in  several 
ways.  The  statistic  itself  is  easy  to  compute.  Its  null  distribu¬ 
tion,  which  requires  only  the  counting  of  the  number,  say  #(w) , 
of  sign  patterns  giving  W  "  w,  provides  exact  significance 
probabilities  without  requiring  any  assumption  of  a  parametric 
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form  for  the  population  (see  Section  2).  In  normal  samples,  the 
Wilc'fxon  test  has  the  high  efficiency  3h  ■  .955  In  the  Unit 
as  n  -*  »  ,  relative  to  the  t-test,  and  Klotz  found  In  1963 
[3]  that  the  efficiency  Is  also  high  for  normal  samples  of  sizes 
5  ^  n  -  10  provided  a  Is  not  too  small.  For  populations  similar 
to  the  normal  but  with  heavier  tails ,  which  Is  perhaps  a  typical 
situation  In  practical  work,  the  large-sample  efficiency  of 
’•Jilcoxon  relative  to  t  can  be  arbitrarily  greater  than  one. 

Because  cl’  the  Importance  of  the  Wilcoxon  test,  It  is 
desirable  to  be  able  to  find  out  something  about  the  distribution 
of  W  in  various  circumstances.  For  example:  Does  the  high 
efficiency  in  the  normal  case  hold  not  only  for  n  S  10  and 
n  -♦  »,  but  also  for  moderate  Intermediate  values?  Does  the 
asymptotic  Insensitivity  of  W  to  heavy  tails  hold  also  when  n 
is  small?  How  robust  is  the  significance  probability  against 
moderate  departures  from  symmetry? 

For  each  of  these  and  many  similar  questions,  we  need  to  find 
the  distribution  of  W  under  the  assumption  that  *  *  *  »*n 

are  drawn  from  a  population  with  distribution  G  which  is  not 
symmetric  about  zero.  As  reviewed  below,  this  distributional 
question  has  not  proved  easy.  We  offc>:  here  a  method  of  approxima¬ 
tion  which  seems  to  be  useful,  at  least  in  some  cases,  and  then 
use  it  to  throw  some  light  on  the  specific  questions  asked  above. 
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It  it  easy  to  tea  that  what  la  said  also  applies,  with  appro¬ 
priate  modifications,  to  another  Important  use  for  the  Wilcoxon 
test-statistic— to  test  the  absence  of  treatment  effect  In  a 
matched-pairs  design.  In  that  case,  X^  represents  the  observed 
difference  in  response  between  treated  and  control  subject  in  the 
1th  pair.  The  random  assignment  of  treatments  gives  W  its  null 
distribution.  If  we  imagine  a  population  of  pairs  In  which  X 
has  the  distribution  G,  the  questions  of  power  and  robustness 
may  also  be  asked  in  this  case.  We  shall  however  for  simplicity 
couch  our  work  in  terms  of  the  one-sample  problem. 

2.  The  null  distribution 

Under  the  null  hypothesis,  P(W  •  w)  «  /'(w) /2n.  The  range 
of  W  is  from  0  to  w^.  -  £n(n+l)  ,  and  the  distribution  is 
symmetric  about  £n(n+l) .  The  function  #  ,  which  is  needed  in 
our  approximation,  may  be  computed  by  means  of  recursion  on  n. 
Tables  J  and  K  of  [2],  proviie  all  values  of  #(w)  for 
n  3  12;  and  for  w  S  n  +  30  where  n  ranges  between  13  and  20, 
inclusive.  For  future  reference,  we  show  the  values  of  #  for 
n  -  10  in  Table  1.  For  w  3  n,  values  of  #  can  also  be  read 
from  the  partition  function  q  given  in  Table  24.5  of  [7]. 

In  case  of  large  n,  and  w  >  n,  the  Edgeworth  approximation 


leads  to  the  approximation 


h 

(2.1)  #(w)  -  2n  •  P(W  -  w) 

-  2n  •  <p(z)  •  ( 1  -  3(u2  -  6u  +  3)  /10 •  (2n+l) 

+  3(u3  -  15u2  +  45u  -  15)  /3t>n2 

+  9(u*  -  20u3  +  2l0u2  -  420u  +  105) /800  n2)/o 

2  2 

where  o  «  n(n+l)  (2n+l)  /4  ,  z  ■  [w  -  £  n(n+l)  ]/o  and  u  -  z  . 

Formula  (2.1)  has  relative  error  of  order  1/n  ,  and  Table  2 
shows  that  it  gives  excellent  results  at  the  limits  of  the  exact 
tables  described  above. 

3.  Distribution  of  W  by  numerical  Integration 

Numerical  integration  gives  a  straightforward  method  of 
finding  the  distribution  of  W  for  a  given  G.  Suppose  the 
continuous  G  has  a  density  g,  which  would  be  so  in  nearly  all 
cases  of  interest.  By  Independence,  the  joint  density  of  the 
sample  is  g(x1)g(x2)  •  •  *g(xn) .  The  n-dimensional  space  may  be 
divided  into  2n  regions  corresponding  to  the  2n  patterns  of 
signs  for  the  absolute  values.  Evaluation  of  the  integrals  of 
this  Joint  density  over  these  2n  regions  will  give  probabilities, 
appropriate  sums  of  which  will  give  the  distribution  P(W  *  w) . 

An  effective  iterative  scheme  for  evaluating  these  integrals  was 
devised  by  Klotz  [7],  who  in  1963  published  results  for  normal 
samples  with  5  *  n  *  10.  We  are  grateful  to  Professor  Klotz  for 
supplying  us  with  his  original  results,  only  part  of  which  have 


s 


5 

hoen  published.  His  values  for  n  ■  10  end  <Jf(l,l)  ere  shown 
in  Che  third  column  of  Teble  4.  Professor  Klotz  points  out  thet 
the  sun  of  his  1024  integrals  is  1.0003,  indicating  that  the  fourth 
decimal  piece  is  not  quite  reliable.  The  Klotz  iterative  formulas 
were  used  by  Arnold  [1]  in  1965  to  obtain  results  for  t-dlstrlbutlau 
with  the  same  sample  sizes. 

While  the  Integration  approach  is  straight  forward ,  it  rapidly 
becomes  very  expensive  as  n  Increases.  Not  only  does  the  number 
of  regions  increase  exponentially  with  n,  but  even  worse  is  the 
increase  in  dimensionality.  (The  difficulty  of  accurate  integration 
in  higher-dimensional  space  is  discussed  in  detail  by  Milton  [6] 
for  the  related  two-sample  problem.)  If  one  wishes  to  survey  a 
wide  range  of  sample  sizes  and  distributional  shapes,  integration 
does  not  appear  to  be  a  practical  approach.  Experience  with  other 
tests  suggests  that  some  sort  of  approximation  is  1  kely  to  be 
helpful  as  a  supplement  to  the  small-sample  calculation.  Fortu¬ 
nately,  it  is  not  difficult  to  find  moments  of  W  for  use  with 
approximations  to  its  distribution, 

4.  Low-order  moments  of  W 

It  is  well  known  that  W  can  be  expressed  as  a  sum  of  indi¬ 
cators,  which  leads  to  expressions  for  its  moments  in  terms  of 
certain  probabilities.  V/e  will  now  record  convenient  formulas  for 
the  first  three  moments: 


(4.1) 


E(W)  ■  [£  q,  *  (n-l)  +  P]  *  n  , 


V*r(W)  -  ([n>23*(n-2)  +  n»22](n-l)  +  m2l)n  • 
H3(W)  -<([n>34  (n-3)  +n»33](n-2)  +  n^Cn-l)  + 

®2i  "  P(l-P), 

®22  "  (P-<li)2  +  3q1(l-q1)/2, 

”23  “  q2  *  ql> 
m^!  -  p(l-p)*(l-2?)  , 

m32  -  6q“  (l^-q^  -  6pq1(l-q1) 

+  3pk,(X  +  3qx)  +  ‘|q1*  (i-q^  *  (l-2qx)  , 

®33  *  6q2  +  3q2  (2-3p-3qx)  +  6q[  (p-2qx) 

+  q^  (12p  +  8qx  -  3)  +  p3, 

“*34  -  q3  +  3r  -  9qxq2  +  5q 

p  -  P(XX  <  0)  -  G(0) 

qk  -  Coo  S(x)Gk(-x)dx 

%  "  p(Z2,X3,’,,»Xk+l  «  “Xl’  X1  «  °) 

*  /»•  GV*) 

%  "  ^(X  2*^3 » *  *  *  ,ZU+1  <  ”X1*  X1  >  ^ 

-  /q  Gk(-x)g(x)d:, 

h(x)  -  g(x)G(-x), 
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(*.16)  H(x)  -  /*„  h(u)du, 

(4.17)  r  -  /"„  h(x)H(-x)dx. 


Notice  that  only  univariate  numerical  integration  is  required  by 
r.ny  of  theae  formulas,  and  that  all  values  of  n  are  dealt  with 
simultaneously.  We  give  as  Table  3  the  values  of  the  coefficients 
for  the  first  three  moments  for  the  normal  distribution  *V(u,l) 
;;ith  unit  variance  and  expectation  p  ,  for  some  of  the  values 
of  p  considered  by  Klotz. 

5.  The  normal  approximation 

It  follows  from  the  work  of  Hoeffding  [4]  that,  as  n  -*•  »  , 


tend  to  thc  stfln<iard  normal  distribution.  At 
first  glance,  one  might  hope  that  this  fact,  combining  with  small- 
s ample  integrations  by  the  method  of  Klotz,  would  solve  the 
problem.  Unfortunately,  it  appears  that  the  normal  approximation 
is  still  very  bad  at  the  practicable  limits  of  the  integration 
approach.  This  is  illustrated  in  the  fourth  column  of  Table  4, 
for  a  sample  of  n  ■  10  drawn  from  <//(  1,1).  The  results  of 
Klotz1 s  integrations  are  compared  with  this  normal  approximation 
(with  continuity  correction)  for  E(W)  ■  5.125  785  and  Var  (W)  - 
23.847  590,  v/hich  are  the  moments  found  by  using  Table  3  and 
formulas  (4.1)  and  (4.2).  The  maximum  error  of  the  cumulative 
form  of  the  normal  approximation  is  -0.1004  . 
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The  reason  for  Che  poor  results  Is  apparent  if  one  glances 
at  Figure  1,  which  shows  the  distribution  of  W  as  a  histogram. 

The  stupe  is  quite  unlike  that  of  a  normal  curve,  and  indeed, 
without  Riots1 s  work,  the  mere  values  of  E(W)  and  S.D. (W) 
could  have  told  us  that  this  must  be  so.  Since  W  is  nonnegative, 
it  cannot  be  nearly  normal  unless  E(W)  is  (say)  at  least  2.5 
times  as  large  as  S.D.  (W) .  From  (4.1)  and  (4.2),  it  can  be 
shown  that  this  will  not  occur  until  n  reaches  67.  The  same 
general  result  is  found  for  other  population  shapes  than  the 
normal.  Therefore,  this  reveals  that  the  normal  tendency  of  W 
does  not  take  effect  until  n  is  much  larger  than  can  practically 
be  dealt  with  by  integration.  It  is  of  course  possible  that  some 
other  "smooth"  approximation  way  be  found  to  give  much  better 
results.  However,  inspection  of  Figure  1,  and  similar  figures 
for  other  cases,  does  not  encourage  one  to  hope  for  good  small- 
sample  results  with  approximations  based  on  Edgeworth  expansions 
or  on  the  Pearson  family  of  curves.  We  somehow  need  a  method  of 
approximation  that  deals  with  the  irregularities  of  the  null 
distribution  o c  W. 

6 .  The  average  probability  method 

Figure  1  shows  that,  for  the  case  of  a  sample  of  10  from 
,/T(l,l),  P(W  -  w)  tends  to  decrease  as  w  increases,  but  in  a 
rather  irregular  way.  This  irregularity  can  be  explained  by  the 
irregularity  of  #(w)  (Table  1).  Thus,  for  each  of  the  values 
w  ■  0,  1  and  2,  there  i&  only  one  sign  pattern,  and  these 


Figure  2:  Average  probabilltlea  of  W  from  ^(1,1), 


avorago 
probability 
of  W  ■  w 


probabilities  decrease  smoothly.  At  w  ■  3,  there  ere  however 
two  sign  patterns,  which  "explains"  why  P(W  -  3)  exceeds  P(W-2), 
rather  than  continuing  to  decline. 

Let  us  consider  P(W  ■  w)/#(w),  the  average  probability  of 
the  #(w)  sign  patterns  corresponding  to  W  -  w.  As  Figure  2 
shows,  these  average  probabilities  do  decrease  in  a  rather  smooth 
way,  at  something  like  an  exponential  rate.  This  suggests  that 
we  write 

(6.1)  L(w)  -  loge[P(W  -  w)/#(w)  ], 

and  Figure  3  confirms  the  very  smooth  behavior,  at  least  in  this 
example,  of  L  as  a  function  of  w. 

We  now  have  an  idea  cor  a  method  of  approximation  to  the 

distribution  of  W.  The  values  of  L  at  the  two  extreme  points 

are  known:  W  ■  0  if  and  only  if  all  the  n  observations  are 

positive,  hence,  we  have  L(0)  ■  n  log  (1-p)  ,  since  #(0)  ■  1 

and  the  probability  of  a  positive  observation  is  1-p  by  (4.11)  ; 

similarly,  L(w  )  -  n  log  p  .  Dy  assuming  that  L(w)  behaves 

smoothly  between  these  terminal  values,  we  can  interpolate  some 

appropriate  smooth  function,  say  l'  .  If  L*  is  close  to  L, 

th.n  P  (w)  -  #(w)«e^.  will  be  close  to  P(w) .  Various  methods 

of  fitting  L1  to  l  are  possible.  We  have  investigated  the  use 

of  the  known  moments  of  W  discussed  in  Section  4.  The  simplest 

functions  with  the  required  values  at  w  ■  0  and  w..  are  the 

M 
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polynomial!.  Of  course,  some  other  smooth  functions  might  be 
more  appropriate  In  some  cases.  We  shall  however  couch  our  dis¬ 
cussion  In  terms  of  polynomial  approximations  which  give  satis¬ 
factory  results  for  all  cases  we  are  going  to  consider  In  the 
next  few  sections. 

We  shall  denote  by  L^,  the  polynomial  of  degree  k,  fitted 

to  L  by  the  requirements  (1)  that  1.^(0)  -  L(0)  and  Lk(wM)  ■ 

,  and  (2)  that  the  corresponding  probabilities  P^w)  ■ 

J-k(w) 

#(w)e  have  the  same  moments  as  W  of  orders  0,l,***,k-2. 

We  thus  Impose  k  -f  1  conditions,  corresponding  to  the  k  +  1 
coefficients  of  a  polynomial  of  degree  k. 

First,  let  us  conolder  the  linear  interpoland,  L^w)  - 
L(0)  +  ajW  ,  where 

(6.2)  ax  -  [L(wfr.)  -  L(0)  ]/wM  -  iL  log  . 

M 

However,  this  interpoland  will  give  probabilities  P^(w)  which 
In  general  do  not  add  up  to  1.  A  proper  distribution  requires 
une  of  the  quadratic  interpoland,  which  we  shall  write  as 

L2(w)  -  Mw)  +  *2  •  w  •  (wM  -  w). 

Clearly,  Pg(w)  is  monotonely  increasing  In  a2,  so  there 
exists  a  unique  value  of  a2  for  which  IP2(w)  “  1*  Since  the 
value  of  p  determines  Lp  which  in  turn  determines  a,  end 
hence  For  any  given  n,  P2  is  a  one-parameter  family  of 

distributions,  governed  only  by  the  parameter  p,  regardless  of 
what  the  particular  underlying  distribution  G  is. 


n 


Table  5  shows,  for  n  -  10  and  for  several  values  of  p, 

Che  coefficients  of  Lg,  and  the  expectation  ZwP2(w)  of  the 
resulting  distribution  Pg.  In  any  given  case,  one  may  compare 
this  expectation  of  the  approximating  Pg  with  the  true  expectation 
E(W) ,  as  glvan  by  (4.1),  in  order  to  help  in  judging  the  adequacy 
of  the  approximation  Pg. 

As  an  Illustration,  let  us  consider  a  sample  of  10  from 
,V(1,1).  From  a  normal  table  one  reads  that  p  -  .1587.  Hence, 
from  (6.2),  a^  -  -.303321,  also  a2  is  uniquely  determined, 
giving  "  .001938.  The  resulting  probabilities  P?  have 
errors  as  shown  in  column  5  of  Table  4.  P2  is  substantially 
better  than  the  normal  approximation,  having  a  maximum  error  only 
12  percent  as  large.  Of  course,  we  know  the  error  of  P2  only 
because  the  Klotz  computations  are  available.  However,  even 
without  these  we  could  have  considerable  confidence  in  P? 
because  of  the  readily  available  moment  check.  Interpolating 
the  fifth  column  of  Table  4  shows  that  the  expectation  of  P2 
is  5.04,  which  is  fairly  close  to  the  correct  value  E(W)  •  5*13 
obtained  from  (4.1). 

In  seeking  a  still  better  approximation,  it  is  natural  next 
to  modify  by  adding  a  cubic  term,  to  obtain  an  approximation 
with  the  correct  expected  value,  say 

L3(w)  -  L2(w)  +  a^.w  (wM -w)  •  (w  -  b^)  . 
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The  proper  velues  of  and  ere  determined  by  the  require¬ 
ment#  that  lP3(w)  -  1  and  IwP3(w)  -  E(W) ,  which  may  be  found 
numerically  by  inspection.  (We  have  not  proved  the  existence 
and  uniqueness  of  a^  and  b^,  but  conjecture  theoe  properties 
will  hold  in  all  reasonable  cases.  At  least  in  all  of  our  trials 
the  convergence  to  the  fitted  values  proceeded  smoothly  and 
without  difficulties.) 

The  search  for  a^  and  is  aided  by  knowing  where  to 

start.  Let  us  suppose  that  the  approximation  is  a  good  one, 

so  that  only  a  small  cubic  correction  is  needed,  and  accordingly 
a^  is  small.  Then,  to  a  first  approximation, 

Vw)  *  Vw)  +  a3  P2^w)*w^wm  ”  w)'(w  "  b3^» 

and  this  approximation,  for  each  W  -  w,  h*s  error  due  to 
replacing  by  1  +  A(w)  for  small  A(w) ,  where 

A(w)  ■  a^  •  w  •  (w^  -  w)  •  (w  -  b^) .  If  we  now  impose  the 
requirements  that  ZPg(w)  ■  ZP3(w)  -  1  and  ZwP^fw)  -  E(W) , 
we  find  these  approximation  values  for  a^  and  b^: 

b*  -  Sj/Sq,  a*  -  [E;w)  -  SwP2]/(S2  -  b’Sj, 
where  S.  *  Zw^-fw)  •w  (w  -  w) .  Table  5  gives  the  values  of 

R  c  n 

*  # 

and  -  b^S^,  which  will  indicate  in  any  given  case  a  good 
point  for  starting  the  search  for  a^  and  b^.  We  have  written 
a  program  for  the  automatic  conduct  of  the  search.  Once  a^ 
and  b^  are  found,  it  is  easy  to  compute  and  hence  P^. 
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Table  *i  dhow*  tho  result*  of  this  method  in  our  normal  example 
Hero,  a*  ■  0*000  013  3185,  -  9.0378,  and  the  searched 

a^  ■  0.000  0  1  2  9195,  ■  9.1249,  and  the  agreement  between 

1*2  and  the  Klotz  values  is  better  than  that  of  P,,. 

The  distribution  will  by  definition  have  the  correct 

values  at  w  •  0  and  at  w  -  w.^,  and  the  correct  expectation. 

The  quality  of  such  an  approximation  may  be  judged  by  comparing 

2  2 

EW  with  Zw  F^w)  or  die  corresponding  variances.  In  this  case, 
the  variance  of  is  24.75,  where  the  correct  variance  of  W 
is  23.85.  If  the  agreement  i*  net  a 8  good  as  desired,  one  may  add 
a  quartic  term  to  L^,  getting 

L4 (w)  ■  L3(w)  +a4*w.(wM  -  w)*(w  -  b4)*(w  -  c4), 

where  a^,  b^,  are  determined  by  the  requirements  that  P^ 
have  the  correct  moments  of  orders  0,  1,  2.  In  the  cases  we 
have  examined,  the  quartic  correction  tends  to  be  small,  and  to 
a  good  approximation  with  error  discussed  above,  we  obtain  the 
trial  coefficients  given  by: 


7a"1— .-  -(Sg-sl  -  Vsj  +  [''Sj3,se  -  "sj-si3 

SSj  -  £S2S0  *  5  * 


I  I  }  IIP  I  I  t  I  i. 

+  (S0-S3)}  -  3(S2-S1)';  -  6S0  Sj  S2  S3]f) 


,  t  *  1 ,  ,,  1  *  1 , 

(S2  -  b4  .  S1)/(S1  -  b4  •  SQ) 


%  -  (EW2  -  Zw2P3(w))/(S2  -  (b*+c*)s|  +  b*  c*  s0) 
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where  ^  (w)  •  w  •  (w^  -  w) .  Unlees  one  is  lucky ,  the 

required  moments  corresponding  to  this  set  of  trial  coefficients 
need  to  be  adjusted.  One  may  make  slight  adjustments  if  these 
values  do  not  give  with  moments  sufficiently  'lose  to  the 
true  values  as  found  by  the  formulas  of  Section  4. 

The  approximations  Pg,  P  f  P^  are  successively  more 
accurate,  as  one  might  hope.  The  maximum  absolute  errors  are 
0.0120,  0.0083,  and  0.0047  respectively.  The  root  mean  square 
errors  for  1  *  w  2  20  decrease  also,  being  .0057,  .0041,  .0027. 

In  principle,  one  might  now  use  E(Vp)  to  add  a  quintic 
terra,  but  if  the  method  is  working  well  for  the  case  at  herd, 
the  agreement  between  P^  and  P^,  and  between  and 

Zw^P^(w),  should  indicate  that  the  P^  approximation  will 
serve.  There  is  an  intuitive  reason  for  thinking  that  P^  is 
a  natural  stopping  point.  Since  both  log  P(w)  and  log  #(w) 
are  nearly  quadratic  in  w  for  large  n,  it  follows  from  (6.1) 
that  L(w)  will  also  be  nearly  quadratic  in  the  interval  near 
E(W)  containing  most  of  the  probabilities.  For  a  polynomial  to 
accord  with  this  shape,  and  also  have  specified  values  at  the 
"distant"  points,  w  -  0  and  w^,  it  requires  five  degrees  of 
freedom  of  a  quartic.  Thus,  one  may  expect  the  approximation  P^ 
to  agree  with  the  asymptotic  normal  approximation  in  large  samples, 
whereas  could  not  be  in  general  expected  to  do  this. 


hi»h«^pw  W'l  »S'  iff"  ■' ■■""■>,» '»>»■*«»" 
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Table  4  shows,  for  our  normal  example,  the  excellent  awro...>- 
of  with  the  results  of  the  Klotz  computations,  with  the 
maximum  error  of  P^ -cumulative  is  0.00^7.  In  addition, 

Iw^P^(w)  -  642.4183,  so  that  P^  has  standardized  third  moment 
1.211,  compared  with  the  value  1.268  for  W.  This  shows  that  the 
skewness  of  this  approximating  P  seems  to  be  about  right.  While 
computations  like  those  of  Klotz  will  seldom  be  available  for  a 
check,  this  third-moment  check  can  be  made  in  general,  as  can  the 
reassurance  of  finding  P^  close  to  P^. 

We  have  also  tried  out  the  average-probability  method  on 
the  computations  of  Arnold,  for  a  sample  of  size  n  -  10  from 
a  re-scaled  t-distribution  with  four  degrees  of  freedom,  and  for 
shift  u  ■  1.0.  By  examining  the  probabilities  published  by 
Arnold,  we  concluded  that  for  w  fc  3,  they  appear  to  have  a 
smooth  behavior.  Hence,  we  applied  our  method  to  approximate 
distribution  of  W  for  3  »  w  *  55.  For  w  <  3,  one  can  always 
ure  the  formulas  stated  in  Section  7.  The  results  are  again  good 
for  P^,  although  not  as  good  as  in  the  normal  example.  The 
reason  appears  to  be  the  exceptionally  heavy  tail  of  t^.  The 
maximum  difference  between  and  P^  is  0.0017,  where  the 

maximum  error  of  the  P^ -cumulative  is  0.008  from  Arnold's  results. 

It  is  also  necessary  to  point  out  situations  when  our 
approximation  does  not  seem  to  wer’  too  well.  One  intuitive  case 
we  have  at  hand  is  t,  with  swrple  size  n  »  10  and  shift  n«1.0. 
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After  reviewing  tome  of  the  probabilities  published  by  Arnold, 
we  realize  that,  with  its  extremely  heavy  tall,  the  assumption  we 
mr.de  for  our  method,  that  L  behaves  smoothly  between  the  two 
extremes,  is  not  satisfied.  As  mentioned  by  Arnold,  there  is 
almost  a  complete  breakdown  in  order  for  the  case  of  £  degree 
of  freedom  in  t-distribution.  For  instance,  with  w  ■  9,  #(9)  *8, 
there  is  one  dominating  factor  corresponding  to  the  sign  pattern 
(+,  +,  +,  +,  +,  +,  +,  +,  +)  with  probability  0.0490;  when 

w  "  10,  #(10)  »  10,  with  dominating  probability  0.1793,  and 

when  w  ■  11,  #(11)  •  11,  the  dominating  probability  is  only 

0.0238.  This  Irregularity  of  course  lies  on  the  fact  that  t^ 
has  an  extraordinary  heavy  tall,  and  this  leads  us  to  believe  that 
the  average-probability  method  will  not  perform  well  in  such  cases. 
7.  An  additional  check 

While  computations  by  numerical  Integration  of  the  entire 
distribution  of  W  Is  seldom  available  and  not  cauily  done.  It 
Is  not  too  difficult  to  get  correct  probabilities  for  small  values 
of  W.  One  can  easily  express  these  probabilities  as  univariate 
Integrals: 

(7.1)  P(W  -  1)  -  n  !°x  g(x)  •  (1  -  G(-x)  f1  dx 

(7.2)  P(W  -  2)  -  n(n-l)  J0_"  g(x)-[G(-x)  -  C(0)  )•  (1  -  G(-x)  ]n"ndx 

P(W-  3)  -  n<n-1^n-2)/°„g(X)  •  tc(-x)  -0(0)  ] -.  [(l-C(-x)  )n'3  dx 
+  n(n-’)/°>i  £(x)[C(0)  -  c(x)]-[l  -  G(-x)  )n'2  dx. 


(7.3) 
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w«  have  found  by  numerical  integration  for  the  cate  of  Table  4 
that 

P(W  -  1)  -  .11365 
P(W  -  2)  -  .07501 
P(W  ■  3)  -  .10360 

which  verify  the  results  obtained  by  Klotz. 

The  probabilities  for  small  w  can  be  used  to  check  on  the 

adequacy  of  the  L>approxlmations.  Alternatively,  they  can  be  used 

to  permit  L  to  be  interpolated  between  w  -  3  and  wu  rather 

than  w  -  0  and  w^.  As  in  the  t^-case  of  the  preceding  section, 

this  is  likely  to  be  useful  in  irregular  situations. 

In  a  similar  way,  P(W  -  w)  for  4  *  w  3  8  can  be  computed 

by  bivariate  integrals,  and  then  L  could  be  interpolated  between 

w  -  8  and  w  ■  wVf. 

rl 

8.  Normal  camples  of  moderate  size 

The  Pitman  analysis  shows  that,  with  very  large  normal 
samples,  the  relative  efficiency  of  the  Wilcoxon  test  to  the 
t-teat  is  3/x  «  0.955.  Based  on  his  computations  for  5  s  n  *  10 , 
Klotz  found  that  efficiencies  lie  in  the  range  of  (.955,  .986) 
for  .01  *  a  £  .10  .  (The  efficiencies  were  lower  for  very 
E'u"ll  a.)  Those  facts  have  led  to  a  widespread  belief  that  the 
efficiency  of  the  Wilcoxon  toot  is  high  in  all  normal  cases  when 
the  value  of  a  is  not  too  small.  There  has  of  course  remained 
the  possibility  that  the  efficiency  was  less  good  for  moderate 
values  of  n  than  for  values  at  either  extreme. 
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To  throw  tome  light  on  this  question,  we  have  applied  the 
average-probability  method  to  a  sample  of  n  ■  20  from  (•75,  1) . 
This  computation  requirea  #(w)  for  n  -  20  recorded  in  Table  2. 
We  found  that  L(0)  -  -5.139  885,  L(210)  -  -29.688  965, 
ax  -  -0.116  900  38,  a2  -  -.000  135  9136,  a3  -  -.000  000  2592, 
b3  -  40.0236,  a^  -  -.000  000  003  484,  b^  -  26.6508, 

■  62.8990.  Table  6  shows  excerpts  from  the  cumulative  forms 
of  the  distribution  P^  and  P^.  The  good  Agreement  of  these 
approximations,  together  with  the  satisfactory  agreement  of  the 
standardized  third  moment  for  P^  (0.644)  and  W( 0.664)  lends 
confidence  to  the  results. 

As  on  additional  check,  we  hose  evaluated  (7.1),  (7.2)  and 
(7.3),  with  these  results: 


w 

P(W  - 

w) 

by  integration 

approx.  P4 

1 

0.0050 

.0051 

2 

0.0043 

.0044 

3 

0.0074 

.0076 

From  this  P^ -approximation,  we  find  the  following  efficiencies 


for  n  -  20: 


w 

a  m 

y-i  *  w] 

0  - 

PA[W  >  w] 

0  for 

n  ■  19 

t-test 

n  -  20 

Relative  eff 

of  W  to  t 

45 

0.01198 

0.20994 

0.21888 

0.19289 

0.9672 

52 

0.024  22 

0.12760 

0.13251 

0.11415 

0.9634 

61 

0.05270 

0.06153 

0.06335 

0.05310 

0.9588 

66 

0.07682 

0.03938 

0.04032 

0.03328 

0.9567 

69 

0.09467 

0.02972 

0.03049 

0.02482 

0.9568 

9.  The  Wllcoxon  diotrlbution  in  a  heavy-tailed  case 

Many  statisticians  consider  that  actual  distributions  found 
in  practical  work  tend  to  resemble  a  normal,  except  that  in  some 
cases  their  tails  are  heavier  than  the  very  exiguous  normal  tails, 
corresponding  perhaps  to  an  occasional  gross  error.  Such  a 
departure  from  the  normal  shape  can  Increase  the  population 
variance  substantially,  to  the  severe  detriment  of  the  large- 
sample  performance  of  the  t-test.  On  the  other  hand,  this  sort 
of  departure  from  normality  will  have  very  little  effect  on  the 
Integral  of  the  square  of  the  population  density,  which  governs 
the  large-sample  performance  of  the  Wllcoxon  test.  Accordingly, 
the  Pitman  analysis  shows  that  Wllcoxon  can  be  substantially 
superior  to  t  in  such  cases. 

This  is  of  course  pertinent  only  when  the  sample  is 
sufficiently  large.  If  the  sample  is  small,  it  would  seem  quite 
likely  that  none  of  the  sample  values  will  come  from  the  tails, 
so  that  in  effect  one  is  sampling  from  the  "normal"  part  of  the 
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population.  In  that  case  the  t-test  should  be  superior.  It  is 
natural  to  pose  a  question:  how  large  must  the  sample  be  before 
the  heavy  tails  can  exert  their  baleful  effect  on  t  relative 
to  W?  An  investigation  of  this  important  question  has  been 
hindered  by  the  fact  thAt  it  is  difficult  to  calculate  the  power 
of  either  test.  The  average-probability  method  permits  one  to 
obtain  a  reasonably  good  approximation  for  the  power  of  W. 

We  present  some  results,  partly  in  the  hope  that  they  may  stimulate 
someone  to  think  of  a  good  way  to  do  the  same  for  the  t-tost,  so 
that  the  comparison  may  be  completed. 

Let  us  take  as  our  heavy-tailed  distribution  a  Tukey  model 
consisting  of  a  blend  of  97#  from  and  3#  from  a  normal 

with  the  same  expected  value  1,  but  with  standard  deviation  4. 

For  a  sample  of  n  •  10  from  this  distribution,  we  find  that 
L(0)  -  -1.814  432,  L(55)  «  -17.961  627,  ax  -  -0.293  585  36, 

a2  -  -.001  785  78,  a3  «  0.000  049  611  2,  b3  -  9.8276 
a^  -  -.000  003  6214,  b4  -  6.2444  and  c^  -  18.6004.  Table  7 
chows  the  cumulative  forms  of  the  distributions  of  P3  and  P^. 
Agreement  is  again  good,  also  for  the  standardized  third-moment 
check,  with  1.122  for  P^  and  1.156  for  W. 

As  an  additional  check,  we  used  the  integrals  of  Section  7: 

_ P(W  *  w) _ 

w  by  integration  by  P^ 

1  .1043  .1058 
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All  checks  indicate  that  our  P^ -approximation  is  performing  veil 
in  this  heavy-tailed  case. 

With  this  blending  of  a  Tukey  model,  the  number  of  gross 
errors,  G,  is  binomial  (n  -  10,  p  ■  0.03)  where  P(G  -  0)  - 
0.737  **24.  In  other  words,  it  may  be  quite  likely,  with  probability 
0.737  424,  that  all  the  10  sample  values  will  be  from  pure  <y(l,l)  , 
for  which  case  we  have  discussed  in  Section  6.  One  interesting 
point  here  is  that,  by  subtracting  this  out,  we  are  able  to  get 
the  conditional  distribution  of  W  given  G  >  0.  That  is, 
knowing  that  some  gross  error  occurred  in  this  sample  of  10, 

P<W  “  "1°  "  °>  ■  ■^Sr575[PpUre^(1,l)(H  -  ») 

-  .737  4*4  •  P4  (W  -  w)  ). 

10.  Robustness  of  the  Wllcoxon  test  against  asymmetry 

The  Wllcoxon  one-sample  test  is  intended  to  test  the  location 
of  the  center  of  symmetry  of  a  symmetric  population.  Any  actual 
population  will  be  asymmetric,  at  least  to  some  extent.  It  io 
accordingly  important  to  know  how  robust  the  test  is  against 
asymmetry.  That  is,  we  need  to  find  out  how  the  actual  significance 
probability  compares  with  the  nominal  value  given  by  the  null 
distribution,  in  case  the  population  is  centered  at  zero  but  lc 
moderately  asymmetric.  The  average-probability  method  can  throw 
some  light  on  this  important  o.uestion. 

Before  this  problem  can  be  tackled,  it  is  necessary  to 
decide  what  is  meant  by  the  "center"  of  an  asymmetric  population. 
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The  concept  of  center  la  naturally  related  to  the  statistical 
tool  being  used.  If  one  works  with  the  sample  mean,  then  the 
population  mean  Is  the  natural  center,  at  least  with  large  samples; 
similarly,  the  sample  median  calls  for  the  population  median;  and 
so  forth.  What  Is  the  "Wllcoxon  center"  of  a  population? 

Let  us  seek  to  define  center  In  such  a  way  as  to  promote 
the  robustness  of  the  significance  probability.  That  is,  we 
want  the  distribution  of  W,  when  the  population  is  "centered" 
at  0,  to  resemble  the  null  distribution  in  Section  2.  This 
distribution  is  (except  for  very  small  n)  nearly  normal ,  so 
presumably  W  will  continue  to  be  something  like  normally  dis¬ 
tributed  for  mildly  asymmetric  populations.  To  keep  the  distri¬ 
bution  of  W  nearly  the  same,  we  try  to  keep  its  location  and 
scaling  nearly  the  same. 

In  the  symmetric  case,  if  the  null  hypothesis  is  true,  then 
q^  •  £  and  p  ■  i,  so  that 

E(W)  -  [£  qx(n-l)  +p]  •  n  -  £  n(n+l) . 

One  may  seek  to  define  center  in  general  so  that,  under  the  null 
hypothesis,  one  will  continue  to  have  -  £  and  p  ■  £,  and 
thus  W  will  continue  to  be  centered  at  £■  n(n+l)  ,  as  in  the 
symmetric  case.  Unfortunately,  it  turns  out  that  this  is  in 
general  impossible:  if  ve  locate  the  population  so  that  q^  «  £, 
then  p  will  differ  at  least  slightly  from  £,  and  vice  versa. 
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At  least  when  n  la  large,  the  dominant  term  of  E(W)  la  tho 
one  Involving  q^.  We  are  led  to  the  idea  of  defining  center  to 
that,  when  the  population  la  centered  at  0,  q1  •  £.  Recalling 
that  is  the  probability  that  -f  <  0  or  equivalently 
that  ^(Xj  +  X2)  <  0 ,  we  see  that  this  event  will  have  probability 
£  if  the  median  of  tlic  distribution  of  £(X^  4*  X2)  is  located 
at  0.  We  are  led  to  define:  The  Wilcoxon  center  of  a  population 
is  the  median  of  the  moan  of  two  observations  therefrom.  We  note 
that  in  the  symmetric  case,  this  definition  yields  the  conventional 
center  of  symmetry. 

In  general,  with  this  definition,  p  will  differ  slightly 
from  £  for  moderately  asymmetric  populations,  and  hence  E (V/) 
will  not  quite  coincide  with  £  n(n+l) ,  though  it  will  be  so 
very  near  if  n  is  large.  However,  the  only  way  to  force 
E(W)  »  £  n(n+l)  would  be  to  have  the  definition  of  the  center 
of  the  population  depend  on  the  size  of  the  sample,  and  that 
would  be  peculiar. 

Let  us  illustrate  these  ideas.  Consider  the  chi-square 
distribution  with  10  degrees  of  freedom,  depicted  in  Figure  4. 

This  population  has  the  sort  of  moderate  skewness,  v;ith  standardize', 
third  moment  of  0.894  427,  that  might  be  encountered  in  practice 
in  cases  when  the  population  was  thought  to  be  symmetric.  If 
Xj  and  x2  a*e  observations,  independently,  therefrom,  then 
X1  +  x2  has  the  chi-square  distribution  with  20  degrees  of  freedom 
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whose  median  Is  19.337*  Thus,  the  median  of  the  distribution  of 

Ji(X1  +X2)  is  9.6685,  that  is,  P(^(X1  +X2)  <  9.6685)  -  i, 

2 

nnd  the  Wilcoxon  center  of  *8  at  9.6685.  For  comparison, 

the  median  is  9*6685,  the  mode  is  8.00,  and  the  expectation 
is  10. 0,  as  illustrated  in  Figure  4.  If  we  translate  the 


Figure  4 

population  to  bring  the  VJilcoxon  center  to  0,  then 
p  -  P(X1  -  9.6685  <  0)  ■  0.53.  Accordingly,  under  the  null 
hypothesis  that  the  Wilcoxon  center  is  at  0, 

E(W)  -  *(n-l)«n  +0.53  n  -  £  n(n+l)(l  + 

at  n  -  10,  we  get  E(W)  ■  27.8  instead  of  the  desired  27.5. 
Since  S.D.  (W)  is  approximately  9.8526,  the  discrepancy  of  0.3 
in  the  expected  value  of  W  is  not  especially  important,  and 
it  would  be  even  less  so  as  n  is  increased  above  10. 

The  magnitude  of  the  departure  of  p  from  £  in  typical 
c£se n  can  be  assessed  by  examining  the  Edgeworth  expansion: 

p  »  P[XX  <  rued  (£  (Xj  +  Xg))  ]  -  Fz(a) 

"  ♦(*)  -  n3^  (z)  +  (*)  +  i  (z) ) 

-  [t)5v^(z)  +  n3*ri4<»'-6^ (z)  +  z  (z)  3  +  **• 
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where 

med.  -  EX^ 

*  "  T.D.fX^"  * 

In  order  to  determine  how  far  out  should  one  carry  for  this 
expansion,  one  needs  to  examine  the  value  determined  in  each 
batch  ([  ])  ,  which  contains  terms  of  the  same  order  of  convergence. 
If  the  value  of  a  certain  batch  is  small  compared  to  the  sum  of 

the  previous  terms,  one  need  not  go  any  further.  To  illustrate 

2 

this  point,  let  us  work  out,  for  example,  x^q  with  n  -  10: 

Z  m  -0.074  126 

where  the  sura  of  the  fir  at  4  terms  sum  up  to  0.529  918.  Adding 
in  the  next  three  terras  will  give  p  »  0.530  418.  For  our  purpose, 
p  ■  0.53  will  serve. 

Let  us  now  return  to  the  robustness  question.  Suppose  we 
have  been  given  a  specific  asymmetric  distribution,  and  have 
translated  it  to  bring  the  Wilcoxon  center  to  0.  Tills  will  make 
■  £  and  will  yield  a  specific  value  of  p.  Accordingly,  for 
n  given  sample  size,  E(W)  is  determined.  We  may  therefore  apply 
the  average  •probability  method  to  yield  an  approximating  distribu¬ 
tion  P^.  Therefore,  we  have  in  a  one-parameter  family  of 
dictributiono  determined  by  the  parameter  p.  By  the  formulas 
of  Section  4,  we  can  calculate  Var(W)  and  Thio  permits 

the  refinement  of  P^  to  P^ ,  and  a  third-moment  check  on  the 


adequacy  of  the  P^  approximation. 
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o 

We  have  carried  out  thin  process  for  the  X}o  oxamplo,  for 
n  «  10: 

E(W)  -  (.249966  (n-1)  +  .529958)  •  (n)  -  27.8 

Var(W)  -  [(.084377  (n-2)  f  .375899)  (n-1)  +  .249103  ]«n 
-  97.0735 

n3(W)  ■  ([(.009280  (n-3)  +  .037334)  (n-2)  -  .000113] 

.  (n-1)  -  .014925}  •  n 

-  73.49325 

with  standardized  third  moment  0.076842.  We  also  found  by  the 
average -probability  method, 

L(0)  -  -7.550  226,  L(55)  -  -6.348  783 

ax  -  0.021  844  418,  *2  -  -.000  00?  4499, 

a3  -  -.000  037  473  2,  b2  «  28.32399 

a4  -.000  OOO  018  749,  b4  -  19.33199,  c4  -  36.44021. 

Table  8  shows  excerpts  of  the  P4-cumulative.  At  a  first  glance 
et  some  of  the  values  shown,  say,  if  you  will  reject  if  W  «  9, 
then  the  significance  probability  shown  by  this  approximation  is 
2.557^  ,  v/here  the  nominal  value  given  by  the  null  distribution 
is  3 . 223;'.  This  does  nor  sound  too  appealing. 

However,  it  is  amazing  how  good  the  result  will  be,  if  one 
is  dring  a  two-tailed  test.  For  example,  if  one  is  willing  to 
reject  if  W  -  9  or  VI  ■  46,  the  nominal  value  will  give  6J;46^, 


and  the  significance  probability  will  add  up  to  6.609$.  The 

reason  that  neither  of  the  one-tailed  tost  la  too  encouraging  la 

2 

due  to  the  ekewneta  based  on  this  asymmetry  distribution 
But  when  one  Is  doing  a  two-tailed  test,  the  significance  prob¬ 
abilities  of  both  sides  seem  to  balance  out.  Thus,  one  may 
conclude  that  tho  robustness  property  of  Wilccocon  test  agalust 
asymmetric  distribution  owns  a  much  more  sound  evidence  in  the 
case  of  two-tailed  test  than  either  tall. 


Table  2.  ff{ w)  and  approximation  (2.1)  for  n  •  20 


w 

#(«) 

w  ^(w) 
approx. 

w 

#(«) 

w  #(w) 
approx. 

w 

#(w) 

w  #(w) 
approx. 

21 

75 

75.3 

51 

2131 

2129.1 

81 

10538 

10537.7 

22 

87 

87.8 

52 

2300 

2299.0 

82 

10864 

10864.8 

23 

101 

102.0 

53 

2479 

2478.0 

83 

11186 

11186.9 

24 

117 

118.0 

54 

2668 

2666, 2 

84 

11504 

11503.0 

25 

135 

136.0 

55 

2865 

2863.6 

85 

11812 

11812.3 

26 

155 

156.3 

56 

3071 

3070.3 

86 

12113 

12113.7 

27 

178 

178.9 

57 

3288 

3286.4 

87 

12407 

12406.3 

28 

203 

204.2 

58 

3512 

35H.7 

88 

12689 

12689.3 

29 

231 

232.4 

59 

3746 

3746.2 

89 

12961 

12561.7 

30 

263 

263.7 

60 

3991 

3989.8 

90 

13224 

13222.6 

31 

297 

298.3 

61 

4242 

4242.3 

91 

13471 

13471.1 

32 

335 

336.5 

62 

4503 

4503.6 

92 

13706 

13706.4 

33 

378 

378.6 

63 

4774 

4773.4 

93 

13929 

13927.8 

34 

424 

424.8 

64 

5051 

5051.4 

94 

14134 

14134.5 

35 

475 

475.5 

65 

5337 

5337.2 

95 

14326 

14325.7 

36 

531 

530.8 

66 

5631 

5630.6 

96 

14502 

14500.8 

37 

591 

591.3 

67 

5930 

5931.0 

97 

14639 

i;.C59.1 

33 

657 

657.0 

68 

6237 

6238.0 

90 

14800 

14800.2 

39 

729 

728.3 

69 

6551 

6551.1 

99 

14925 

1493.6 

40 

806 

805.6 

70 

6869 

6869.7 

100 

15029 

15028.7 

41 

889 

889.I 

71 

7192 

7193.2 

101 

15115 

151 15. 2 

42 

980 

979.2 

72 

7521 

7521.0 

102 

15184 

15182.8 

43 

1076 

1076.0 

73 

7851 

7852.3 

103 

15231 

15231.3 

44 

1180 

1380.0 

74 

8I85 

8186.5 

104 

15260 

15260.4 

45 

1293 

1291.4 

75 

8523 

8522.8 

105 

15272 

15270.2 

46 

1411 

1410.4 

76 

0059 

8860.4 

47 

1533 

1537.4 

77 

9197 

9198.5 

i.  a 
*  . 

3674 

1672.5 

78 

9536 

9536.1 

1*5 

1317 

1816.1 

79 

9871 

9372.5 

30 

1969 

1968.2 

30 

10206 

10206.7 

30 


Table  3.  Coefficients  of  first  three  moments  of  W 


(n,l)  H  -  0.25  (.25)  1.50 


coeffX  .25  .  50  .  75  1.00  1.25  1.50 


p 

.401 

293 

.291 

160 

.226 

627 

.158 

655 

.096 

800 

.066 

807 

«1 

.361 

835 

.234 

235 

.144 

422 

.078 

650 

.037 

574 

.016 

947 

°2l 

.240 

257 

.206 

386 

.175 

267 

.133 

484 

.087 

430 

.062 

344 

®22 

.347 

923 

.277 

200 

.192 

104 

.115 

097 

.058 

567 

.027 

476 

"23 

.075 

34o 

.056 

585 

.033 

799 

.016 

881 

.006 

946 

.002 

403 

"31 

.047 

430 

.086 

203 

.095 

827 

.091 

128 

.070 

504 

.054 

014 

“32 

.225 

539 

.345 

618 

.333 

059 

.243 

020 

.137 

157 

.070 

001 

"33 

.147 

244 

.201 

906 

.173 

162 

.103 

590 

.045 

998 

.017 

046 

"34 

.019 

875 

.030 

189 

.019 

515 

.009 

968 

.003 

798 

.001 

058 

32 


Table  5.  Given  n  -  10 


p 

*2 

E(W) 

• 

b3 

S2  -  tySj 

.02500 

-.01634063 

.31381 

1.83707 

22.15959 

.05000 

-.00904215 

.79815 

2.97823 

169.80828 

.07500 

-.00579759 

1.47716 

4.28975 

620.71346 

. 10000 

-.00400116 

2.34955 

5.68875 

1562.50147 

.12500 

-.00288598 

3.39762 

7.11726 

3122.03702 

.15000 

-.00214043 

4.59610 

8.54549 

5343.75854 

.17500 

-.00161469 

5.91989 

9.96180 

8200.59779 

.20000 

-.00122953 

7.34483 

11.36261 

11609.44424 

. 22500 

-.00093940 

8.84901 

12.74807 

15449.32008 

.25000 

-.00071599 

10.41587 

14.12031 

19581.89779 

.27500 

-.00054129 

12.03147 

15.48157 

23859.44368 

.30000 

-.00040333 

13.68448 

16.83394 

28134.12860 

.32500 

-.00029388 

15.36601 

18.17928 

32264.89875 

.35000 

-.00020715 

17.06903 

19.51912 

36121.49424 

.37500 

-.00013905 

I8.78805 

20.85475 

39587.65504 

.40000 

-.00008661 

20.51871 

22.18717 

42562.86995 

.42500 

-.00004772 

22.25764 

23.51725 

44963.98652 

.45000 

-.00002075 

24.00459 

24.84574 

46729.73432 

.47500 

-.00000517 

25.75C43 

26.17310 

47801.99147 

33 


Table  6.  The  cumulative  probabilities  of  Approximating 
P^  and  from  *^0.75,  1)  with  n  -  20 


w  #(w)  P3(W  «  «)  P4(W  «  w) 


0 

1 

.0059 

.0059 

1 

1 

.0109 

.0109 

2 

1 

.0153 

.0153 

3 

2 

.0230 

.0229 

4 

2 

.0296 

.0295 

5 

3 

.0382 

.0381 

6 

4 

.0481 

.0480 

7 

5 

.0589 

.0587 

8 

6 

.0701 

.0699 

9 

8 

.0831 

.0828 

10 

10 

.0972 

.0968 

11 

12 

.1119 

.1113 

12 

15 

.1278 

.1272 

13 

18 

.1444 

.1437 

14 

22 

.1620 

.1611 

15 

27 

.1807 

.1798 

16 

32 

..2000 

.1990 

17 

38 

.2200 

.2188 

18 

46 

.2409 

.2397 

19 

54 

.2623 

.2610 

20 

64 

.2343 

.2829 

9 

.  1 

Table  7. 

Excerpt  of  the  cumulative  probabilities  of  the  \ 

approximating  and  from  Tukey’a 

model— a  blend  of  1 1 

0.57  from  ^(1,1) 

and  0.03  frome/^!,  ) ;  n  ■  10  1  j 

w 

m 

p3(w  a  w) 

yw  * ») 

— - - - 

. . . 

I  0 

i 

.1629 

.  1629 

1 

i 

.2707 

.2687  : 

2 

i 

.3426 

.3387 

3 

• 

2 

.4396 

.4330 

■i 

4 

2 

.5055 

.4973  1 

|  ; 

5 

3 

.5734 

.5642 

6 

4 

.6360 

.6266 

1  , 

7 

5 

.6906 

.6818 

8 

6 

.7367 

.7291 

9 

8 

.7803 

.7744  j 

10 

10 

.8192 

.8154  i 

. 

26 

39 

.9952 

.9977 

27 

40 

.9964 

.9984  i 

28 

40 

.9973 

.9989  j 

29 

39 

.9980 

.9993  I; 

30 

39 

.9985 

.9995  V 

31 

38 

.9989 

.9997  \\ 

32 

36 

.9992 

.9998  I 

33 

35 

.9994 

.9999  1 

34 

33 

.9996 

.9999  9 

35 

31 

.9997 

j 

0 

0 

0 

0 

• 

H 

Table  8,  P^ -cumulative  of  Wilcoxon 

o 

dletrlbution  from  x10  with  n  -  10 


w 

N(W) 

P^-cuo 

W 

m 

P^-cum 

0 

1 

.0005260 

40 

20 

.8928811 

1 

1 

.0010943 

41 

17 

.9094992 

2 

1 

.0017046 

42 

15 

.9243857 

3 

2 

.0030076 

43 

13 

.9375258 

4 

2 

.0043908 

44 

11 

.9488886 

5 

3 

.0065813 

45 

10 

.9594839 

6 

4 

.0096488 

46 

8 

.9682115 

7 

5 

.0136563 

47 

6 

.9749787 

8 

6 

.OI86585 

48 

5 

.9808340 

9 

8 

.0255651 

49 

4 

.9857195 

10 

10 

.0344671 

50 

3 

.9895591 

11 

11 

.0445238 

51 

2 

.9922548 

12 

13 

.0566837 

52 

2 

.9951082 

13 

15 

.0709878 

53 

1 

.9966265 

14 

17 

.0874601 

54 

1 

.9982513 

15 

20 

.  1070904 

55 

1 

1.0000000 

36 
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